Error recovery and sentence verification using statistical partial pattern tree for conversational speech
نویسندگان
چکیده
In this paper, in order to deal with the problems of disfluencies in conversational speech, a partial pattern tree (PPT) and a PPT-based statistical language model are proposed. A partial pattern is defined to represent a sub-sentence with a key-phrase and some optional/ functional phrases. The PPT is an integrated tree structure of the partial patterns generated from the training sentences and used to model the n-gram and grammatical constraints. In addition, a PPT merging algorithm is also proposed to reduce the number of partial patterns with similar syntactic structure by minimizing an objective cost function. Using the PPT, the undetected/misdetected errors due to disfluencies can be recovered. Finally, a sentence verification approach is proposed to re-rank the recovered sentences generated from the PPT. In order to assess the performance, a faculty name inquiry system with 2583 names has been implemented. The recognition accuracy of the system using the proposed PPT achieved 77.23%. We also contrasted this method with previous conventional approaches to show its superior performance.
منابع مشابه
Recovery from false rejection using statistical partial pattern trees for sentence verification
In conversational speech recognition, recognizers are generally equipped with a keyword spotting capability to accommodate a variety of speaking styles. In addition, language model incorporation generally improves the recognition performance. In conversational speech keyword spotting, there are two types of errors, false alarm and false rejection. These two types of errors are not modeled in la...
متن کاملSub-sentence discourse models for conversational speech recognition
According to discourse theories in linguistics, conversational utterances possess an informational structure that partitions each sentence into two portions: a “given” and “new”. In this work, we explore this idea by building sub-sentence discourse language models for conversational speech recognition. The internal sentence structure is captured in statistical language modeling by training mult...
متن کاملProsodical sentence structure inference for natural conversational speech understanding
In order to develop a system capable of understanding natural conversational speech, along with the current developments in technology for phonetic information processing, a technology must be developed that will utilize prosadie information of natural speech. We propose here an algoritha for generating a parsing tree that represents for the semantical relationships between phrases, based on an...
متن کاملDialogue act detection in error-prone spoken dialogue systems using partial sentence tree and latent dialogue act matrix
In a goal-oriented spoken dialogue system, the major aim of spoken language understanding is to detect the dialogue acts (DAs) embedded in a speaker’s utterance. However, errorprone speech recognition often degrades the performance of the SLU component. In this work, a DA detection approach using partial sentence trees (PSTs) and a latent dialogue act matrix (LDAM) is presented for spoken langu...
متن کاملA Sharp Sufficient Condition for Sparsity Pattern Recovery
Sufficient number of linear and noisy measurements for exact and approximate sparsity pattern/support set recovery in the high dimensional setting is derived. Although this problem as been addressed in the recent literature, there is still considerable gaps between those results and the exact limits of the perfect support set recovery. To reduce this gap, in this paper, the sufficient con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000